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We show how to analyze and interpret the correlation structures, the conditional expecta- 
tion values and correlation coefficients of exchangeable Bernoulli random variables. We study 
implied default distributions for the iTraxx-CJ tranches and some popular probabilistic mod- 
els, including the Gaussian copula model, Beta binomial distribution model and long-range 
Ising model. We interpret the differences in their profiles in terms of the correlation struc- 
tures. The implied default distribution has singular correlation structures, reflecting the 
credit market implications. We point out two possible origins of the singular behavior. 

KEYWORDS: correlation, calibration, Beta binomial, Gaussian copula model, default, depen- 
dency structure, Ising model 



1. Introduction 

Describing and understanding crises in markets are intriguing subjects in financial engi- 
neering and econophysics.^"^ In the context of econophysics, the mechanism of systemic failure 
in banking has been studied.^' ^ The Power law distribution of avalanches and several scaling 
laws in the context of the percolation theory were found. In addition, the network structures 
of real companies have been studied recently and their nonhomogeneity nature have been clar- 
ified. ^"^^ This feature should be taken into account in the modeling of the dependent defaults 
of companies. 

In financial engineering, many products have been invented to hedge the credit risks. CDS 
is a single-name credit derivative which is targeted on the default of one single obligor. Collat- 
eralized debt obligations (CDOs) are financial innovations to securitize portfolios of defaultable 
assets, which are called credit portfolios. They provide protections against a subset of total 
loss on a credit portfolio in exchange for payments. From an econophysical viewpoint, they 
give valuable insights into the market implications on default dependencies and the clustering 
of defaults. This aspect is very important, because the main difficulty in the understanding 
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of credit events is that we have no sufficient information about them. By empirical studies 
of the historical data on credit events, the default probability pd and default correlation pd 
were estimated. However, more detailed information is necessary in the pricing of credit 
derivatives and in the evaluation of models in econophysics. The quotes of the CDOs depend 
on the profiles of the default probability function. This means that it is possible to infer 
the default loss probability function from market quotes. Recently, such an "implied" loss 
distribution function has attracted much attention in the studies of credit derivatives. Instead 
of using popular probabilistic models, implied loss distribution are proposed to use.^^'^^ 

In this paper, we show how to get detailed information contained in probability functions 
for multiple defaults. We compare the implied loss probability function with some popular 
probabilistic models and show their differences in terms of the correlation structure. The 
paper is organized as follows. In §2 we start from the definition of exchangeable Bernoulli 
random variables and explain the term "correlation structures", the conditional expectation 
values and correlations. We introduce several notations of related quantities. Using the re- 
cursive relations, we show how to estimate them. We also point out that the method can be 
applied to any probability function of Bernoulli random variables, that are not necessarily 
exchangeable. In §3, we show how to infer the loss probability function for multiple defaults 
from the CDO market quotes by the entropy maximum principle. We compare the implied 
loss probability function with those of some popular probabilistic models in §4. The differ- 
ences become strongly apparent in the behavior of the conditional correlations. The singular 
behavior of the implied loss function should be attributed to the nonlinear nature of multiple 
defaults or network structures of companies. We also try to read the credit market implica- 
tions contained in the market quotes of CDOs and make a comment on "Correlation Smile" . 
Section 5 is devoted to the summary and future problems. In the appendix, we explain the 
relation between the profiles of probability functions and the correlation structures. 

2. Calibration of Correlation Structures 

In this section, we show a method of obtaining the "correlation structure" from the prob- 
ability function. We denote the i-th asset's (or obligor's) state by Bernoulli random variable 
Xi = 0,1 {i = ,N). If the asset is defaulted (or non-defaulted), Xi takes l(resp.O). 

We assume that XjS are exchangeable. The exchangeability means that the joint probability 
function of XjS is independent of any permutation of the values of XjS. Denoting the joint 
probability function as 

Prob.(Xi = xi,X2 = X2,--- ,Xn = xn) = P{xi,X2, ■ ■ ■ ,xn), 
the next relation holds for any permutation ii, i2, • • • , zat of 1, 2, • • • , A'^, 

P{xi , X2 5 ■ ■ ■ , 2; at) — P{Xi^ , Xjj ) ■ ■ ■ ) ) • 
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By assumption, the remaining degree of freedom in the joint probability function reduces to 
A^. The joint probability for i defaults and j nondefaults only depends only on i and j, and 
we denote it as X^j. The probability function for n defaults i'Ar(n) is written as 

Here jvC^ is the binomial coefficients. 

The term "correlation structure" means the conditional expectation values Pij and corre- 
lations pij . The subscript ij of pij and pij means that they are estimated under the condition 
that any i (resp.j) of N variables tale 1 (resp. 0). We also introduce qij as 1 —Pi,j- Po,o is the 
unconditional expectation value and it is nothing but the default probability pd- po.o is the 
unconditional default correlation p^. More detailed explanations about pij and pij are given 
in the appendix. These quantities satisfy the following relations^^ 

Pi+i,j = Pi,j + -Pi,j)pi,j, (2-1) 

qi,j+i = qi,j + {l-qi,j)pi,j, (2.2) 

Pi-ij-Pi,j-i = -{i - Pi-ij)Pi-ij - Pi,j-iPi,j-i- (2.3) 



N-2,0 



XXX 

N- 1 ,0 N-2, 1 N-3,2 
• • • • 



• • • • • 

X X X X 

^^,0 N-1,1 N-2,2 N-3,3 

Fig. 1. Solving process for Pi,j,qij and Pij from Xn.N-n- 



Using these recursive relations, it is possible to estimate pij and pij from PN{n) or 
Xn,N-n- Xn,N-n are On the bottom line of the Pascal triangle (See Fig.l). Then recursively 
solving the above eqs. (2.1)-(2.3) to the top vertex (0,0) of the Pascal triangle, we obtain all 
Pijs and Pijs. For example, to obtain p^^ifi we use the relations 

Xn,o = Xn-1,0 ■ PN-i,o and Xn-i,i = Xn-i,o ■ (1 - pn-i,o)- (2.4) 
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Solving for pN-i,i, we get 

PN-.,o = jr^ • (2.5) 

Likewise, we can estimate pij for general i, j < N — 1. Prom pij, pij are obtained by solving 
eq. (2.1). 

The important point is that it is possible to estimate the correlation structure for any 
PAr(n). In addition to theoretical models, empirically obtained probability functions can be 
studied. If X^s are exchangeable, the obtained pij and pij are the ones defined in the text. 
In network terminology, the exchangeable case corresponds to the complete graph i^jv where 
all nodes are similar to each other with the same strength. If the network structure is not 
uniform, XjS are not exchangeable. Even in such a case, the above method is applicable and 
gives many insights into the system. For example, if the network structure is treelike, the 
obtained correlation structure should be completely different from that in the exchangeable 
case. Its singular behavior strongly suggests the nonuniform network structure of the system. 

3. Implied Default Distribution 

We show how to infer the loss probability function based on market quotes of CDOs.^^' In 
advance, we briefly explain CDOs. CDOs provide protection against losses in credit portfolios. 
Here "credit" means that the constituent assets of the portfolio can be defaulted. If an asset 
is defaulted, the portfolio loses its value. The interesting point of CDOs is that they are 
divided into several parts (called 'tranches'). Tranches have priorities that are defined by 
the attachment point ul and the detachment point ajj. The seller of protection agrees to 
cover all losses between aiKj^otai and auKTotah where Kxotai is the initial total notional 
of the portfolio. That is, if the loss is below aiKxctah the tranche does not cover it. The 
tranche begins to cover it Only when it exceeds aiKTotal- If it exceeds anKTotaU the notional 
becomes zero. The seller of protection receives payments at a rate s on the initial notional 
{an — CLjKTotal - Each loss that is covered reduces the notional on which payments are based. 
A typical CDO has a life of 5 years during which the seller of protection receives periodic 
payments. Usually these payments are made quarterly in arrears. In addition, to bring periodic 
payments up to date, an accrual payment is performed. Furthermore, the seller of protection 
makes a payment equal to the loss to the buyer of protection. The loss is the reduction in the 
notional principal times one less the recovery rate R. 

iTraxx-CJ is an equally weighted portfolio of fifty CDSs on Japanese companies. The 
notional principal of CDSs is K and K^otal is 50 K. The recovery rate \s R = 0.35. The 
standard attachment and detachment points are {0%, 3%}, {3%, 6%}, {6%, 9%}, {9%, 12%} and 
{12%, 22%}. We denote them as {a^, a^^} with i = 1, • • • ,5. Table I shows the tranche struc- 
tures and quotes for iTraxx-CJ (Series 2) on August 30, 2005. Wc denote the upfront payment 
as Ui and the annual payment rate as si in basis points per year for the zth tranche. In the last 
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row, we show the data for the index that cover all losses for the portfolio. In the 6th column, 
we show the initial notional Nq in units of K. 

Table I. Quotes for ITraxx-CJ (Series 2) on August 30, 2005. Quotes are in basis points. Source: 
Tranche, Morgan Stanley Japan Securities Co. and Index, Bloomberg 
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The value of contract is the present value of the expected cash flows. For simplicity, we 
treat 5 years as one term and write T = 5 [year]. Wc also assume that defaults occur in the 
middle of the period. Wc denote the notional principal for the ith tranche outstanding at 
maturity as A'^. The expected payoff of contract is 

UiN^ + T<N^> SiC-''^ + {N^- < N't >)^e-'^5. (3.1) 

Here, < A > means the expectation value of A and r is the risk-free rate of interest. The 
expected loss due to default is 

(A^^- < iV^ >)e-''¥. (3.2) 

The total value of the contract to the seller of protection is given by cqs. (3.1)-(3.2). Risk 
neutral values of Sj and Ui are determined so that eq. (3.1) equals eq. (3.2). Conversely, the 
market quotes for Si and Ui tell us about the expected notional principal < Nj, >. We write 
them as implied- '^^^ column in Table I shows them from the market quotes Sj and Ui. 
Nj, are random variables and are related to the number of defaults n at maturity as 

Nkn) = \ anN-nil-R) l^]<n<\^] (3.3) 

Here, \x] means the smallest integer greater than x. To calculate the expectation value of 
Nj,{n), the default probability function PAr(n) is necessary. Inversely, using the data on these 
expectation values -^Tj^^^^g^, we try to infer PN{n) from the maximum entropy principle. 
It states that one should consider the model PN{n) that maximizes the entropy functional 
subject to the conditions imposed by previous known information. 
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The entropy functional S'[P/v(n)] is defined as 

N 

S'[P|v(n)] = ^ nCu ■ Xn,N-n ^OgXn,N-n 
n=0 

6 N 

,Implied )■ (3.4) 

j=l n=0 

In order to impose the condition < Nj: >= implied PN{n), we introduce six Lagrange 
multiphers A*. By maximizing 3.4, we get the imphed joint probabihty Xn,N-n as 

{^-Xen-X\a],N-n(l-R)) "qS^^ q n < n\j 

^-X,n-Xi+^ai+^N-n{l-R)) p^5^ .^^ ^. < n < n^' (3.5) 

e^-^sn n>n%. 

Here, we use the notation = [^^1 ^^'^ ~ exp(— A*iVj,(n)). 




Fig. 2. Plot of implied default distribution for fifty Japanese companies on August 30, 2005. 



The six Lagrange multiplier were calibrated so that the condition < Nj, >= implied 
satisfied. We use the simulated annealing method and fix these parameters. Figure 2 shows 
the result of fitting eq. (3.5) to iTraxx-CJ data on August 30, 2005. About the convergence, it 
is satisfactory and all premiums are recovered within 1%. Prom the inset figure, which shows a 
semilog plot of the distribution, we see a hunchy structure or a second peak. (n) decreases 
monotonically up to the fourth tranche (n < 9), then PN{n) begins to increase. In the fifth 
tranche n^^ = 10 < n < n\j = 17, PN{n) has a peak and then decreases to zero. We also 
see some joints between tranches at n]j. The latter is an artifact of the maximum entropy 
principle. 
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4. Comparison with Popular Probabilistic Models 

In this section, wc compare the behaviors of the loss probabihty function Pjv(n) of some 
popular probabilistic models with the implied loss distribution function from the viewpoint of 
the correlation structure. In particular, we focus on p.j.o and pi^. As probabilistic models, we 
consider the next three models. These models are defined by the mixing function /(p) that 
express the joint probability function Xij as^' 

Xi,j = J f{p')p'\i-p'ydp'. (4.1) 

We choose the Gaussian copula (GC) model, which is a standard model in financial engi- 
neering,^^ the beta binomial distribution (BBD), which is the benchmark model among ex- 
changeable correlated binomial models^^ and the long-range Ising (LRI) model. ^ The reason 
for adopting LRI, instead of the Ising model on some lattice, is that in financial engineering 
all obligors are usually assumed to be related to each other with the same strength and that 
the network structure is uniform. In addition, the long-range Ising model can be expressed as 
a superposition of two binomial distributions for sufficiently large N and it is very tractable. 

(1) Gaussian Copula (GC) Model. 

The model incorporates the default correlation by a common random factor Y and an 
asset correlation pa- If the factor Y is fixed asY = y, the variables Xi become independent 
with the probability Prob.(Xj = 1) = p{y). The explicit form of the mixing function is 

f{p{y)) = ^{^-^). (4.2) 

Vl - Pa 

Here, K = <^~^{pd) with the normal cumulative function $(iC) and Y obeys the normal 
distribution Y ~ iV(0, 1^). Xi j are then given as 

X,,j=<p{yy{l-p{y)y >Y . (4.3) 

< >y denotes the expectation value over the random variable Y. In order to estimate 
Pd, we use the relation pa = ^^j^- 

(2) Beta Binomial Distribution (BBD) Model. 

The mixing function f{p') is the beta distribution. 



■'^^^ B(a,/3) ^ ^ 



Here B(q;,/3) is the beta function. arc given as 

B( a + i,/3 + j 
B(a,/3) 

It is easy to show that po,o = Pd = ^ and po,o = Pd = a+l+i ■ 

We note that BBD is the benchmark model among exchangeable correlated binomial 
models, pij depend on i,j through the form i + j as pij = j^ip^^^j^- As the result, p^ 



7/16 



J. Phys. Soc. Jpn. 



Full Paper 




10 20 30 40 50 

n 

Fig. 3. Plot of implied default distribution (Fig. 2) and the probability functions of popular pricing 
models. Implied distribution (solid), GC (x), BBD (+) and LRI (dashed-dotted line). We set 
N = 50,p = 1.65% and pd = 6.55%. 



becomes a linear function of i for the fixed i + j = k as 

_ Pd(l - Pd) + i- Pd 
l + (fc-l)M ■ 

BBD is the "linear" model^^ that is why we call it the benchmark model. One can see 
the nonlinearity of other models by checking the differences of pij and pij from those of 
BBD. 

(3) Long-Range Ising (LRI) Model.^''' 

The mixing function f{p') is the superposition of two 6 functions 5{p'—p) and 6{p' — (l—p)). 

f{p') = (1 - a) • d{p' -p) + a5{jp' + (1 - p)). (4.6) 

Xi^j are given as 

X,,,- = (1 - a)p\l - pY + a(l - p) V. (4.7) 
It is easy to show that po,o = = (1 - a)p + a(l - p) and po,o = Pd = ' 

Figure 3 shows a plot of the implied distribution of the previous section with the probabil- 
ity function PN{n) of the above three models. The models have three parameters : the number 
of variables N, the default probability pa and default correlation p^. We set them with the 
same values of the implied distribution as N = 50, Pd = 1.65% and pd = 6.55%. We see that 
all three models give poor fits to the implied distribution. GC and BBD show a monotonic 
dependence on n. The LRI model has a nonmonotonic dependence and has a hump at n = TV. 

Next, we compare their correlation structures. Figure 4 depicts pi^ and pi^Q. pi^ for GC 
has a low peak and decays to zero slowly. BBD's pi^o decays slowly as pi^ = yh^- On the 
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40 

Fig. 4. Plots of pifi and pi^ (inset figure). 

other hand, LRI's pi^ rapidly increases to 1 and decays rapidly to zero. This behavior means 
that GC is weakly nonlinear and LRI is strongly nonlinear. 

As for pifi, recall the relation Pi+ifl = Pifl + (1 —Pifi)pi.o (cq.( 2.1)). With the same pd and 
Pd, we have po,o = Pd and pi^ =pd + {l -pd)pd- AH curves {i,Pifi) go through the two points 
{Q,Pd) and {l^Pd + (1 — As p^.o for i>l differs among the models and the implied one, 

the curves {i.pi.o) depart from each other for i >2. LRI's pi^Q rapidly increases to 1. pi^ also 
increases to 1 rapidly. For i = 3, pi^ ~ 1 and this means that all the obligors always default 
simultaneously if three of them are defaulted, which is the biggest avalanche. As the result, 
P/v(n) has a hump in its tail n = N. GC's pi^Q and BBD's pi^Q increase to 1 with i slowly. The 
distribution of the size of avalanches should be very wide and PN{n) comes to have a long 
tail. 

The implied one's pi^ has a medium peak at i = 1 and then rapidly decreases to zero for 
z > 5. Comparing it with those of BBD, we sec that the implied loss distribution function is 
nonlinear. Its behavior is completely different from those of both GC and BBD. pj^o increases 
rapidly with i compared with GC and BBD and soon saturates to some maximum value ~ 0.35 
at i = 5. The credit market expects that if more than 5 defaults occur, the obligors default 
almost independently. The size of an avalanche of simultaneous defaults is smaller than that of 
the Ising model. However, the probability that a medium-size of avalanche of defaults occurs 
is large compared with the GC and BBD. 

We also studied the correlation structures of the implied loss functions of iTraxx-Europe 
and CDX IG (U.S.A.), which arc CDOs of European and American companies {N = 125).^^ 
The implied distributions and pi^ are plotted in Fig. 5. The implied loss functions arc more 
complex than that of iTraxx-C J. pj^o shows the same singular behavior with those of iTraxx-CJ 
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Fig. 5. Implied default distribution on August 30, 2005. iTraxx-CJ (in solid), iTraxx-Europe (dashed) 
and CDX IG (+). 



Table II. Implied tranche correlations and entropy maximum correlation for 5-year iTraxx-CJ 

tranches on August 30, 2005. 

{0%,3%} {3%, 6%} {6%, 9%} {9%, 12%} {12%, 22%} Entropy 
13.5% 1.20% 2.58% 4.95 % 9.71 % 6.55 % 



and the singular behavior seems to be a universal property. 

About the origin of the singular behavior of pi^, we point out two possibilities. The 
first is that the probabilistic rule that governs the defaults of obligors is essentially new and 
nonlinear. The second is that the nonuniform network structure of the dependency relation of 
the obligors is reflected in pi^. If the network structure is not uniform, it affects the resulting 
correlation structure. As a result, pifi looks singular compared with those of the models on 
the uniform network. 

At last we comment on the tranche (compound) correlation, which is the standard corre- 
lation measure in financial engineering.^^ The method suggests the correlation so that the 
expected loss equals the expected payoffs for the i—th tranche, it is called "tranche correla- 
tions". The expected values are estimated with GC. Table II shows the tranche correlations 
for the quotes of iTraxx-CJ on August 30, 2005. In the last column, we show the maximum 
entropy value derived from the implied default distribution. As we showed previously, the GC 
gives a poor fit to the implied distribution. The tranche correlations are completely different 
from the entropy maximum value. In addition, it depends on which tranche the correlation is 
estimated. Such a dependence is known as a "correlation smile". We think that the "true" 
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default correlation is approximately given by the maximum entropy value and that tranche 
correlations are an artifact of using GC to fit the market quotes. As long as the probabilistic 
model gives a poor fit to the market quotes, the default correlation varies among the tranches. 
This is the origin of the "correlation smile" . 

5. Conclusions 

We show how to estimate the conditional probabilities pij and correlations pij from 
PN{n). BBD is the benchmark model among exchangeable correlated binomial models and 
Pij behave as pij = jqi^^^y^. If the obtained pij depends on which is considerably 
different from those of BBD, there are two possibilities. The first one is that the probabilistic 
rule that governs XjS is strongly nonlinear. The second one is that the assumption of the 
exchangeablity is wrong. The network structure of the dependency relation among XjS is 
nonuniform. 

We have inferred the loss probability function for multiple defaults based on the market 
quotes of CDOs and the maximum entropy principle. The profile is completely different from 
those of some popular probabilistic models, namely GC, BBD and LRI. pi^ has a medium 
peak and then rapidly decreases to zero for i >5. The origin of the singular behavior can be 
attributed to the above two possibilities. 

In order to clarify the mechanism of the singular behavior of pi^, it is necessary to study 
correlated binomial models on networks. In particular, the dependence of pi^o on the network 
structure should be understood. Recently, the authors have shown how to construct a linear 
correlated binomial model on networks in general. By applying the method of the present 
paper to the model, it is possible to understand the relation between the network structure and 
the correlation structures. More detailed studies of real companies' dependency structures have 
been performed recently.^'^ Instead of the implied loss function, a real loss distribution function 
has been estimated. Promoting these studies, we think that it is possible to understand the 
dependency structure of multiple defaults and to propose a theoretical model of the pricing 
of CDOs. 
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Appendix: Probability Function and Correlation Structure 

In this section, we explain the relation between PN{n) and the correlation structures pi^j 
and Pi J. We introduce the products of Xi and 1 — Xj, which exhaust all observables of the 
system. 



The following definitions are their unconditional and conditional expectation values (see Fig. 



i i+j 



(A-l) 



i'=i j'=i+i 



A-L). 



Xij — < Uij > . 



(A-2) 



Pi,j = < Xi^j+i\Ilij = 1 >=< Hj+ijlllij > 




(A-3) 



12/16 



J. Phys. Soc. Jpn. Full Paper 

qi,j = <i-Xi+,+i|n,j = i>=<n,,,+i|n,,,>=^^. (a-4) 

Here < A\B > means the expectation value of the random variable A under the condition 
that B is satisfied. Xo,o = 1, -^^1,0 = Po,o and Xq^i = 1 — po,o = Qofi- All information of the 
model is contained in ^ij. The joint probability P{xi,X2, ■ ■ ■ ,xn) with X^^^^ Xj' = n is given 
by Xn^N-n- The probability function PNiji) is given as 

N 

PN{n) = Prob(y^ Xi = n) = nC„ ■ Xn,N-n- 

i=l 




Fig. A-1. Pascal's triangle representation of Xij up to i + j < 2 and Pij,qij ■ ^0,0 =< 1 >, 
Xifi =< Xi >= p, Xq^i =< I - Xi >= 1 - p = q etc. 



We also introduce the conditional correlation 

Corr(Xj_|-j+i,Xj+j+2|nij = 1) = Pij. 
The correlation between Xi and Xj is defined as 



CoTr{Xi,Xj) = 



< XiXj > - < Xi >< Xj > 



(A-5) 



(A-6) 



y< Xi > (1- <Xi>)< Xj > (1- < Xj >)■ 
Its conditional ones are defined by replacing expectation values with conditional expectation 
values. 

The conditional quantities Pi,j,qij and pij must obey the recursive relations from eqs. 
(2.1)-(2.3). The reason is that the following two relations must hold for the system to be 
consistent. The first one is pij + qij = 1 for any because of the identity < l\Iii,j = 1 >=< 
Xi+j+i + (1 — Xi+j+i)\Ilij 



1 >= 1. The second one is the commutation relation 



qi+i,j ■ Pi,j - Pi,j+i ■ qi,j 



x„ 



(A-7) 



These two relations are guaranteed to hold when pij, qij and pij satisfy the above consistency 
relations. 
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We explain the meaning of these quantities. The first one is po,0) the unconditional ex- 
pectation value of Xi. Its meaning is clear and it is the probability that Xi takes 1. In the 
context of a credit portfolio problem, it is the default probability p^- It is easy to estimate it 
from P/v(n) as 

Pd = Pofl =< Xi >=< n > /N. (A-8) 

The unconditional correlation /9o,o is the default correlation pa in the credit risk context. It is 
also easy to estimate it as 

<n^-n> /N(N - I) - pj, , 

Pd = Po,o = n \ ■ 

Pd{^ - Pd) 

Its estimation is important in the evaluation of the prices of credit derivatives. One reason is 
that it is related to the conditional default probability pi^o from eq. (2.1) as 

Pifi = + (1 - Pd)pd- (A-10) 

If one obligor is defaulted, the default probability p^i changes to pi^Q. The second reason is 
that it gives the simultaneous default probability for Xi and Xj as 

Prob.(Xi = 1, Xj = 1) = pl+Pd{l - Pd)Pd- (A-11) 

Usually, Pd is small and the simultaneous default probability is mainly governed by the second 
term. 

Regarding pi^ with z or j > 0, we note one point. Prom the definition, means the 
default probability under the condition Hi j = 1. pjj also means the default correlation in 
the same situation, pi^j^ with I > i and m > j are closely related to the default probability 
function P]^_(j^^j-^{n — {i + j)\Ilij = 1). We write k = i + j and the next relation holds for 
n> k. 

n~k N—n 

PN-k{n - k\Iiij = 1) = N-kCn-k- < n ^k+l n (1 - ^n+m) in^j = 1 > (A-12) 

1=1 m=l 

We evaluate the expectation value with „j and g^^^, and we get 

n-fe-l N-n-1 

PN-k{n - k\Ilij = 1) = N-kCn-k ■ Yl P-'+l'j n ^n-k+i,j+m n > k. (A-13) 

1=0 m=0 

This relation indicates that the Pascal Triangle with the vertex (i, j), {N — j, j), and (i, N — i) 
contains all information for the case Iljj = 1 (See Fig. A-2). In order to know the loss 
probability function under the condition Hij = 1, we only need to know pi^^n ^^id qi^^ in the 
restricted Pascal Triangle. 

The i-dependence of pi^ and pi^Q is closely related to the behavior of the probability 
function PN{n) for n > i. By the relation, we can understand the cascading structure of the 
simultaneous defaults. Hereafter, as we are interested in the credit risk problem, we assume 
that po,o = Pd is small. 
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First, we note that P/vl"-) can be expressed in the following form for n>i. 

PN{n) = -J^^IL- . Xifl ■ PN-i{n - i\Ui,o = 1). (A-14) 

N—i^n—i 

The derivation is based on the following relation. 

n N 

PN-i{n - i\Ui^O = 1) = N-iCn-iX < n n ~ Xm)\lk,0 = 1 > 



l=i+l m=n+l 

N 



iCn-iX <\{Xi J] (1 - x^)|ni,o = 1 > 



Z=l m=n+l 
n N i 

= N-iCn-iX <llXi II {1-Xm)> / <llXi> 
1=1 m=n+l 1=1 

= ^_,Cn-iX^^/Xi,o. (A-15) 

Equation (A-14) tells us about the behavior of PN{n) for n > i 
We classify the behavior into two cases. 

(1) Short-tail case : 

The probability function PN{n) develops a short tail in the case where rapidly de- 
creases with i and pij = for i > k and j < i — k with k « N. For the case Iljj = 
with i > k and j < i — k, all variables become independent. Xij is estimated as 

Xi,j = Xk,o- < I^i,j\I^k,o = >= Xkfi ■ p]^q • • • ^,o- 
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The probability function Pjv(^) for n > k becomes 

PN{n) = nCu ■ ■ ■ q^^Q . 
Pk,0 

PN{n) becomes proportional to the binomial distribution Bi(A^,pfc o) and has a short tail. 
It has a hump at n ~ • Pkfi- particular, if pjt q — the probability function has a 
hump ak, n = N . 

(2) Long-tail case : 

The probability function has a long tail in the case where pi^Q is small and gradually 
decreases with i. The random variables are weakly coupled. The z-dependence of is 
given by Pi+i,o = Pi,o + (1 — Pi,o)pi,o and pi^ gradually increases with i. If we assume 
Pi,o = 0) PN{n) becomes proportional to the binomial distribution Bi(iV,pj_o) for n> i. 

PN{n) = NCn-^-plo-q^i'' for n>i. 

Pi,o 

However, pj q is not zero and Pi^ gradually increases with i. For n > i + i-jv(n) behaves 
as 

PNin) = NCn-^^-p?+i,o-<l^o~'' for n>i. 
= Xifl ■ pifi, we have 

n / \ -^1,0 Pifi n N—n a \ ■ 

Pjv(n) = nCu ■ '— ■ P^+ifi ■ Qi^Q " for n> i. 

Pi+lfl Pi+lfi 

As Pi+io > Pio, the overall scale "f''° • is smaller than Apart from the 

overall factor, P/v(n) becomes proportional to Bi(A'^,pj^o) with a larger pi^Q for a larger i. 
Compared with that in the short-tail case, the decrease in Pjv(n) with n is milder and 
PAr(n) has a longer tail. 
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